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ABSTRACT 

Context. Star clusters are studied widely both as benchmarks for stellar evolution models and in their own right. Cluster age distribu- 
tions and mass distributions within galaxies are probes of star formation histories, and of cluster formation and disruption processes. 
The vast majority of clusters in the Universe is small, and it is well known that the integrated fluxes and colours of all but the most 
massive ones have broad probability distributions, due to small numbers of bright stars. 

Aims. This paper goes beyond the description of predicted probability distributions, and presents results of the analysis of cluster 
energy distributions in an explicitly stochastic context. 

Methods. The method developed is Bayesian. It provides posterior probability distributions in the age-mass-extinction space, using 
multi- wavelength photometric observations and a large collection of Monte-Carlo simulations of clusters of finite stellar masses. The 
main priors are the assumed intrinsic distributions of current mass and current age for clusters in a galaxy. Both UBVI and UBVIK 
data sets are considered, and the study conducted in this paper is restricted to the solar metallicity. 

Results. We first use the collection of simulations to reassess and explain errors arising from the use of standard analysis methods, 
which are based on continuous population synthesis models: systematic errors on ages and random errors on masses are large, while 
systematic errors on masses tend to be smaller. The age-mass distributions obtained after analysis of a synthetic sample are very 
similar to those found for real galaxies in the literature. The Bayesian approach on the other hand, is very successful in recovering the 
input ages and masses over ages ranging between 20 Myr and 1.5 Gyr, with only limited systematics that we explain. 
Conclusions. Taking stochasticity into account is important, more important for instance than the choice of adding or removing near- 
IR data in many cases. We found no immediately obvious reason to reject priors inspired by previous (standard) analyses of cluster 
populations in galaxies, i.e. cluster distributions that scale with mass as and are uniform on a logarithmic age scale. 

Key words. Galaxies: fundamental parameters, photometry, star clusters, stellar content — Methods: data analysis — Techniques: 
photometric 



1. Introduction 

In the early decades of astrophysics, star clusters have been our 
main key to the understanding of stellar evolution. While clusters 
continue to provide precious constraints on stellar physics, they 
' are today studied in their own right and as tracers of the histories 
, of galaxies. It has become clear that a significant fraction of star 
' formation occurs in clusters, and tha t events such as interacting 
I galax i es can trigger their form ation (Harris 1991; Meurer et al. 
119951: iBarton. Geller. & Kenvo n 2000; Di Matteo et al. 2007). 
Questions have been raised regarding the IMF in clusters in var- 
ious environments, about the systematic trends in their colour 
distributions, about their lifetimes as gravitationally bound ob- 
jects and about the initial and current cluster mass functions. 

Resolved observations of individual stars remain the most 
precise way of investigating the nature of clusters and will be 
possible out to distances of 10 Mpc with future extremely large 
telescopes. However measurements of the integrated light of un- 
resolved star clusters reach far beyond this scale already today, 
and will remain the path of choice for the studies of large sam- 
ples. 

All our studies of individual clusters and of cluster pop- 
ulations in galaxies rest on our ability to estimate their cur- 
rent ages, masses and metallicities, while accounting for ex- 
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tinction. The standard method of analysis of integrated clus- 
ter light is based on the direct comparison of the observed 
colours with predictions from continuous population synthesis 
models. These models predict fluxes with the assumption that 
each mass bin along the stellar mass function (SMF) is popu- 
lated according to the average value given by this SMF. Studies 
based on continuous population synthesis models have led to 
results that have a large impact on today's description of clus- 
ter "demographics". For instance, it is now usually admitted 
that the current cluster mass function dec reases with mass as 
a power law with an index close to -2 (Zhang &Falll 1 19991: 
iBik et al.|[2003l: iBoutloukos & Lamersll2003h and the debate on 
the cluster survival rate also rests on dist r ibutions obtained us- 
ing continuous models (Fv^sperinil [79981: iFall & ZhangI l200ll: 
iLada & Ladall2003l: iRafelski & Zaritskvll2005h . The continuous 
approach has been coupled with statistical data analysis, for in- 
stance to provide the impression that including near-IR photom- 
etry (K band) solves the age-metallicity degeneracy for clusters 
(iGoudfrooii et al.1 l200ll : iPuzia et al.l l2002l: lAnders et al.1 l2004l: 
iBridzius et al.l 1200 8'). Still in the context of continuous popula- 
tion synt hesis, ICid Fernandes & Gonzalez De lgado ( 2010) fol- 
lowed by lGonzalez Delgado & Cid Fernandei dMlOl) developed 
a Bayesian analysis of the integrated spectra of star clusters. 

The continuous population synthesis models are strictly 
valid only in the limit of a stellar population containing an 
infinite number of stars. Real clusters, however, count a fi- 
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nite number of stars. Furthermore most of the light is pro- 
vided by a very small number of bright stars, in particular in 
the near-IR. The so-called stochastic fluctuations in the inte- 
grated photometric properties are the result of the random pres- 
ence of these luminous stars. Some of these can be quantified 
using selected inform ation provided by continuous population 
synthesis models (e.g . Lancon & Mouh cin3l200Ql: ICervino et al.l 
I2Q02I: ICervino & Lurid iana 2004, 2006i), bu t others require the 
use o f discrete popula tion s ynthesis models (fearbaro & Bertelh 
19771: iGirardi & Bicalll993l:lBruzualll2002l:lDeveikis et al.ii2008i: 



Popescu & HansonI 120091 : iPiskunov etani2009l) . The predicted 
luminosity and colour distributions depend strongly on the to- 
tal mass (or star number) in the cluster, and can be far from 
Gaussian even when the total mass exceeds 10^ M©. The most 
probable colours are off'set from those predicted by continuous 
population synthesis when masses are below 10"^ M©, because 
the single most luminous star in such clusters will be more often 
on the main sequence than in the red giant phases of evolution. 
Attempts to describ e the colour distributions a nalytically have 
made progress (e.g. ICervino & Luridiana|[20Q6b . but are not yet 
easily applicable. 

The present piece of work is based on discrete popula- 
tion synthesis. For the first time, we use the discrete models 
not only to predict colour distributions but to analyse the en- 
ergy distributions of clusters. We present a Bayesian approach 
to the probabilistic determination of age, mass and extinction, 
based on a large library of Monte-Carlo simulations of clus- 
ters. This method is a close analog to the one introduced by 
iKautfmann et al.l (l2003l) for the study of star formation histories 
in the Sloan Digital Sky Survey. However the variety of observ- 
able properties has completely diff'erent origins in both contexts: 
stochasticity at a given age, mass and metallicity plays a pre- 
dominant role here, while diff'erent star formation histories pro- 
vide all the diversity in the model collections used for galaxy 
studies. We compare determinations based on the Bayesian ap- 
proach with traditional estimates, thus providing a new insight 
into systematic eff'ects and their consequences. In this first pa- 
per, we focus on data sets consisting of either UB VI or UB VIK 
photometry. Future work will extent to other pass-bands and the 
addition of the metallicity dimension. 



2. Synthetic populations 

The analysis of cluster colours must be based on synthetic spec- 
tra that explicitly account for the random fluctuations due to 
small numbers of bright stars. 

We have constructed large catalogs of synthetic clusters us- 
ing Monte-Carlo (MC) methods to populate the Stellar Mass 
Function (SMF) with a finite number of stars. For the pur- 
poses of this paper, all synthetic clusters host simple stellar 
populations (SSP), i.e. their stars are coeval and have a com- 
mon initial composition. The synthetic clusters are generated 
with a discrete population synthesis co de we derived from 
Pegase (iFioc & Rocca-Volmerangdri997l) . Stellar evolution at 
solar me tallicity is modeled with the evolutionary tracks of 
iBressan e t^l. (1993 ). The inpu t stellar spectra are based on the 
library of iLejeune et al.l ((1998!), as we will be intereste d mostly 
in bro ad band photometry here. The SMF is taken from lKroupal 
(I2OOII) . It extends from 0.1 to 120 M©. Nebular emission (lines 
and continuum) is included in the calculated spectra of young 
objects under the assumption that no ionizing photon escapes. 
When extinction c orrections are considered, they are based on 
the standard law of lCardelH. Clavton. & MathisI (Il989h . 



Our current collection of MC-clusters contains two types of 
catalogs. 

The first set of catalogs contains collections of clusters with 
equal numbers of stars. In these (earlier) catalogs, the cluster 
ages take 69 values that are distributed on an approximate log- 
arithmic scale between 1 Myr and 20 Gyr. Metallicity is solar 
(Z= 0.02). These catalogs are available for clusters of 10^, 3.10^, 
6.10^ 10^, 3.10^, 6.104,10^ stars. They contain a total of 69,000 
clusters (1000 for each of the 69 time steps). 

Most of the results in this paper are based on a second cat- 
alog, which consists of 1.5.10^ clusters with ages, taking 309 
values, distributed between 1 Myr and 20 Gyr, with masses 
above about 500 M© and with metalHcities Z=0.008, Z=0.02 
and Z=0.05. We will only discuss solar metallicity here. The 
distribution of ages is flat on a logarithmic scale above 20 Myr 
(younger ages are currently under-represented). The number 
of stars in a given cluster is drawn randomly from a power 
law distribution with index -2. As a result, the mass distri- 
bution of the clusters in the sample fall ofl' approximately as 
M"^ (note that M is the current stellar mass of the cluster, not 
its initial mass). These age and mass distributions were cho- 
sen as_a_Q0Ssibleje2resei^^ distributions in galax- 
ies (iFall. Chandar. & Whitmord[2009l) . although we caution that 
empirical determinations in the current literature are based on 
non- stochastic studies. At this point, the distribution simply 
has the convenient feature that it includes many small clusters, 
those for which the distributions of predicted properties are most 
complex. It would be a significantly larger computational chal- 
lenge to produce a collection with a flatter distribution and a 
similar number of small clusters. On the other hand, the current 
collection includes only a small number of very massive clusters. 
Although the properties of the latter are more well-behaved, we 
consider the catalog incomplete above 2.10^ M© and focus on 
smaller clusters for the time being. 

The difl'erences between the properties of discrete popula- 
tions and the standard predictions from continuous populations 
synthesis are generally larger than standard observational errors. 
This becomes particularly true in the near-infrared bands, and 
for young and intermediate ages. It requires clusters with sev- 
eral lO^M© to narrow down the stochastic fluctuations to 5% in 
the K-band (iLancon et al.ll2008l) . Figure [T] illustrates the scatter 
of observable properties associated with some of our synthetic 
cluster catalogs, and allows direct comparison with standard pre- 
dictions from continuous synthesis. On the left panel, the lower 
luminosity clusters all contain 10^ stars (with the assumed SMF, 
their mass distribution is peaked around 500 M©), while the 
higher luminosity clusters contain 10^ stars. It is clear that the 
distributions are highly mass-dependent. Most small mass clus- 
ters have no post-main sequence star, because the average num- 
ber of such stars (given by continuous population synthesis) is 
lower than one. The model density map in the right panel repre- 
sents our main catalog. Due to the distribution of star num- 
bers and to stellar lifetimes in various evolutionary phases, is- 
lands of high model density are present. Distributions in other 
colour-colour and colour-magnitude diagrams can be found in 
later sections. 



3. Analysis methods 

The above synthetic populations can be used to analyse photo- 
metric observations of clusters. The properties we seek to es- 
timate in this paper are the cluster ages and masses. We wish 
to account for the fact that observed colours can be afl'ected by 
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Fig. 1. Stochastic properties of star clusters at solar metallicity. On the left panel, the dots represent photometric properties of 
individual clusters containing 1 0^ and 10^ sta rs each. The solid lines show the corresponding age sequences from "standard" 
predictions (Figure inspired by iBruzua On the right panel, we show the corresponding density distribution of models 

in our main catalog, constructed assuming a power law with a index -2 for the cluster mass function. Note that the general aspect of 
this and subsequent model density maps will not change if the total number of clusters is increased, as required to extend the present 
study to higher cluster masses. 



unknown amounts of extinction. Accounting for uncertainties in 
the metallicity is postponed. 

In this section, we describe the Bayesian method developed 
for the analysis. Two other analysis methods are briefly de- 
scribed for comparison. One is a simple best-;^^ fit to all the data 
in the synthetic cluster catalog, the other is the usual estimate 
based on continuous population synthesis predictions. The com- 
parison between results obtained with the three methods pro- 
vides important insights into systematic eff'ects. 

3.1. Standard estimates: the "infinite limit" 

The standard procedure implicitly considers the mass distribu- 
tion of the stars in a cluster to be continuous. It has been applied 
widely, but as already mentioned, it is strictly valid only for pop- 
ulations of infinite mass. It becomes a reasonable approximation 
for clusters with masses above an age-dependent limit of about 
5.10^ Mq. We include this method mainly to quantify the errors 
produced when it is applied to analyse the light of clusters of 
lower masses. 

In families of spectra predicted with continuous models, 
mass is a simple scaling factor that applies to all fluxes. 
Therefore model spectra are frequently published scaled to a to- 
tal population mass of 1 M©. Spectra are described only by age 
and extinction (and metallicity). 

Assuming observational errors are Gaussian and indepen- 
dent, the most likely continuous model for a given set of broad- 
band fluxes is a minimum of the following function: 



){ c 



(1) 



where Y = {Yk}ke[i..n] are the available data and the CTk the cor- 
responding observational uncertainties. Yj^^j^^ are the fluxes pre- 
dicted for the model Moo (total mass 1 M©) and M is the mass 
required to optimize the fit with this model. 

We allow for extinction by looping through positive values 
of Ay and repeating the optimization procedure. 



3.2. Bayesian Estimates 

As opposed to its continuous analogue, this approach explic- 
itly accounts for the discrete and stochastic nature of the stel- 
lar populations. The cost in computation time is rather high, as 
the c ollecti ons of synthetic clusters explored must be large (see 
Sect. [6231). 

As in all Bayesian approaches, the results are stated in prob- 
abilistic terms, and they depend on a priori probability distribu- 
tions of some model parameters. In our case, the most probable 
ages and masses for a cluster, given a set of photometric observa- 
tions, will depend on the age distribution and mass distribution 
of the synthetic clusters in the model catalog (the statement can 
be extended to include extinction and metallicity). 

We assume observational errors are Gaussian with 
known standard d e viatio ns (a preliminary study of 
iLancon & Fouesneaul (l2009l) used boxcar functions for the 
error distributions). An intrinsic model property X, for given 
photometric values Y, has the probability: 



P(X\Y)ocp(Y\X)xP(X). 



(2) 



Since errors are assumed to be Gaussian, the probability P(Y\X) 
can be expressed using the usual statistic: 



P(Y\X) oc e 



(3) 



Then, the probability distribution of an intrinsic property, such as 
the age or mass of an individual cluster, is given by the follow- 
ing relation. The probability for property X to be located in an 
interval [xi, X2], given photometric measurements Y = {Yk}kE[i..n] 
with uncertainties ct;^, is: 



P( X e [xuX2]\Y) = a 



z n 



Mi /X(Mi)E[Xi,X2] 



4 



(4) 



271(tI 



In the above, a is the normalization constant (the value of which 
we calculate). The sum extends over all models that have an ad- 
equate value of X. P(Mi) is the probability assigned to an in- 
dividual model. Unlike the continuous models, the mass is an 
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intrinsic parameter of each modelled population. As a first step, 
we consider all models in our main catalog equally probable, 
and therefore we inherit the distributions of age and mass used 
to construct the catalog. When we vary extinction, we consider 
flat probability distributions for Ay between two boundaries. 
Through the factors P{Mi), this expression of the probability can 
embody any prior mass or age distribution. 

An analog of Eq. |4]can be written for joint probability dis- 
tributions, for instance for age, mass and extinction. The most 
probable ages and masses given in this paper are the age and 
mass coordinates of the maximum of the joint distribution. Error 
bars can be given by examining all models above a given proba- 
bility threshold. 

In our main catalog, as we mentioned before, the prior mass 
function is a power law with an index -2, and log(age) is dis- 
tributed uniformly (above 50 Myr). Clearly, it will be neces- 
sary to investigate the eff'ects of these assumptions quantitatively, 
by varying them within reasonable limits. Due to the above de- 
scribed catalog mass completeness limit, we postpone this study 
to a future work. 

3.3. Single best fit 

As a simple first step towards accounting for stochastic fluctu- 
ations, one may look at the one model in the catalog that mini- 
mizes the standard function 



=2 



(5) 



4. Analysis of UBVIK photometry 

As a first study, we compare the results from the three meth- 
ods above in the joint analysis of U, B, V, I and K band fluxes. 
The analysis is applied to the synthetic absolute magnitudes of a 
subsample of clusters from our main catalog. The presentation is 
twofold: in Sect . 14 . 21 and 14.31 the synthetic magnitudes are used 
unchanged, while in Sect. l4.4l noise is added before the analysis 
is performed. 



4.1. Input sample description 

The comparison of mass and age estimates is presented for clus- 
ters with relatively small masses, where our catalog is most com- 
plete and where the efl'ects of stochasticity are most important. 
Figure [21 shows the mass-age distribution of our input sample. It 
contains 1000 models randomly selected from our main MC cat- 
alog. It covers a relatively small mass range, but a broad range 
of ages (though mostly above 10 Myr). The model colours are 
not reddened (unless otherwise stated) and the model cluster dis- 
tances are set to 10 pc - the observable properties associated 
with the models are absolute magnitudes or fluxes. Note that the 
selected sample reflects the intrinsic prior mass and age distribu- 
tions of the catalog. 



4.2. Estimates from photometry without noise 

The estimates of age and mass obtained from the synthetic 
data with the standard method and with the stochastic Bayesian 
method are shown in Fig. [3l Estimates from the single best fit 
method are not shown, as this method returns the input model 
itself when no noise is added to the synthetic data. The stan- 
dard deviations in the above equations ([B, (|4| and ([5]) are set 
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Fig. 2. Model sample from our main MC catalog. 1000 mod- 
els were randomly extracted from the catalog in order to iden- 
tify systematics. Selected clusters are low mass populations and 
cover a broad range of ages. Fluxes are not reddened and cali- 
brated as absolute fluxes, corresponding to a 10 pc distance. 



arbitrarily to 0.05 magnitudes. In this first experiment, the ab- 
sence of extinction, is considered to be known to the "observer". 
This could be constrained for instance with emission line mea- 
surements or with estimations of the total gas content of the host 
galaxy. The alternative experiment, where the extinction is a free 
parameter of the analysis is described in Sect. 14.31 

4.2.1 . Biases in the estimates from the standard method 

The bottom panels of Fig.[3]reveal the inadequacy of continuous 
population synthesis models for the analysis of the colours of 
realistic clusters of small masses. 

The fits to the UBVIK data are poor. 93% of the;^^ values 
are more than 3cr above the value expected for a good fit if one 
assumes observational errors of 0.05 magnitudes (cr refers to the 
standard deviation of a ;^^-law with 3 degrees of freedom). In 
other words, no statistically acceptable match is found in most 
cases. 

Cluster ages are typically underestimated, with a few excep- 
tions. Errors of 1 or even 2 dex are not rare. As a consequence, 
mass estimates are highly dispersed. And if one fails to reject 
clusters with poor fits, the masses are on average underestimated 
by 0.3 to 0.5 dex. 

Fig |4] helps us understand why derived ages tend to concen- 
trate around 5-9 Myr and 80 - 100 Myr, in features that we 
labeled ^6 and in Fig. [3l leaving gaps at other ages. We 
note that this remarkable eff'ect is seen in many cl uster m ass dis- 
tributions presented in the literatur e (e.g. Gieles 2009) and has 
received partial explanations (e.g. iFall. Chandar. & Whitmord 
I2005h . 

Since there is no extinction, the optimization will simply 
search for the nearest continuous model in the UBVIK-space. 
For intermediate age clusters, hooks in the locus of continu- 
ous models make individual ages more or less attractive. For 
instance, the supergiant evolution phase, which occurs around 
9 Myr in our models, is very attractive and translates into the 
feature in Fig. [3] The accumulation around 100 Myr (feature 
^6) results from the redward excursion of V-K when the up- 
per Asymptotic Giant Branch (AGB) first becomes populated. 
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Fig. 3. Estimations based on U, B, V, I and K band fluxes without noise, when assuming no extinction (Ay = 0, "true" value). 
Estimations from the Bayesian approach are given in the two upper panels and standard estimation considering an infinite number 
of stars are given in the two lower panels. The x-axes represent, respectively, the original age and mass of the sampled populations 
whereas the y-axes represent the estimated values from the two methods. Dashed lines highlight the identity function. Labels are 
referenced features within the text. Open circles highlight poor fits where the reduced;^^-values are larger than 3cr confidence range, 
standard deviation of the -law. 



The attractive eff'ect of these particular points is enforced by the 
fact that the spread of the models is significantly wider in V-I 
compared to U-B (Fig|4]): moving across wide parts of the dis- 
tribution in U-B has a lower cost in terms of variations than 
moving along U-B. 

Although in most cases the best values are not statisti- 
cally acceptable (again statistically 93% of the input-clusters are 
not well fitted), the match nevertheless provides a decent age for 
relatively old populations. 

Finally, objects containing more luminous stars than the con- 
tinuous models, tend to be very red. With no extinction, this 
leads to overestimates of the population ages, which produces 
feature[3]f on Fig. [3] 

Typically, young continuous models are more luminous, per 
unit mass, than older ones. Hence any trend in the age estimates 
will translate into the equivalent eff'ect on the derived masses. 
Therefore, this method tends to also underestimate masses of 
small clusters. 



4.2.2. Bayesian estimates 

Unlike the standard method, the Bayesian analysis based on 
stochastic cluster catalogs returns ages and masses in excellent 
agreement with the input values as shown by the top panels of 
Fig. [3] As already mentioned, the single best fit model is ex- 
actly the model used to synthesize the photometry for the anal- 
ysis. Shown here are the most probable model properties, after 
a Bayesian analysis that assumes observational errors of 0.05 
magnitudes (although no noise was actually added to the input 
photometry). 

The masses assigned with the Bayesian method are dis- 
tributed symmetrically around the input values, except for a cut- 
off' at 500 M© due to the low-mass limit of our cluster catalog. 
About 20% of the analysed clusters are potentially aff'ected by 
this limit. 

The standard deviation of the residuals are of 0.1 dex in age 
above 20 Myr and 0.08 dex in mass above 10^ M© (the age and 
mass lower limits are set to avoid statistical biases due to the 
current limitations of our MC-catalog). 
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Fig. 4. Photometric properties of stochastic populations compared to average predictions at solar metallicity. On the left hand panels, 
the dots represent coulours of individual clusters contained in our MC catalog. Note that there are large regions of overlap between 
age bins, and that older models hi de younger models in certain area s of the figure. The extinction vector is given by the arrows 
assuming a Cardelli extinction law (ICardelli. Clayton. & Mathisll 19891) . On the right panels, the corresponding density distributions 
of the models are shown, assuming cluster masses follow a power law with an index -2. The solid lines represent the corresponding 
age sequences from "standard" predictions. Numeric labels on the bottom left panel correspond to the age sequence from 1 Myr to 
20 Gyr. Dotted lines emphasize high density regions mentioned in Sect. 14.21 



4.3. When extinction is a free parameter 

In the previous section, cluster colours were analysed by com- 
parison with models during which the amount of reddening im- 
posed to be null in the optimisation. 

We have shown that in most of cases the standard analysis 
method leads to under-estimated ages, and masses, even with 
high quality photometric data across the spectrum from U to K. 
The fraction of clusters with underestimated ages was signifi- 
cantly reduced when adopting stochastic models rather than the 
classical continuous ones. 

We now assume that no independent information on extinc- 
tion is available to the "observer". Hence for the analysis we 
allow the extinction. Ay, to vary between and 3 with steps of 
0.2. 

Figure [5] shows the results obtained when repeating the anal- 
ysis of the reddening-free sample of Sect. 14.11 this time autho- 
rizing any reddening of the models. The colours of a reddened 
sample are analysed in Sect. 14.3.31 



4.3.1 . Estimates from the standard method 

The bottom panels of Fig. [5] confirms the inadequacy of con- 
tinuous population models for the analysis of realistic clusters 
of small masses. By authorizing the extinction to vary, we do 
increase the fraction of clusters whose colours can be fitted sat- 
isfactorily with continuous models, but only to 22 %. 

Again assigned ages are highly clustered, and most of them 
are underestimated. As a consequence, mass estimates are highly 
dispersed. 

In the standard method, dereddening is the only cost-free 
way, in terms of;^^, to move an observed cluster to the line repre- 
senting the continuous models. If we look at the UBVI or UBVK 
colour-colour planes shown on Fig. |4] (left panels), the redden- 
ing vector is almost orthogonal to the loci of stochastic clusters 
of constant age. Recall that the input sample is drawn from the 
stochastic collection even if it is analysed with the continuous 
models. Dereddening therefore leads to underestimated ages. 

Derived ages tend to concentrate around around 5-9 Myr, 
80 - 100 Myr and also 1 Gyr, figures that are labeled [Ha, [11/5 
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Fig. 5. Estimates based on U, B, V, I and K band fluxes without noise, allowing an extinction Ay from to 3. The conventions and 
axis significations are the same as Fig.O On the left-hand side, the age estimations. On the right-hand side, the mass estimations. 



and [sly in Fig.[5l Using the direction of the extinction vector in 
Eig. |4] and starting from the bending points of the line of con- 
tinuous models, one can define large areas of the colour-colour 
planes in which all clusters will be assigned the same ages (with 
fits of medium or poor quality). Eor instance, all clusters with 
-0.5 <U-B< 0.3 and V-K< 1.5 (or V-I< 0.2) (clusters in 
the yellow or blue age bins and below the continuous model 
line) will be assigned ages of 5 - 7 Myr, which causes feature 
very similar to the previously described Oy. Old clusters 
are preferentially located on two branches of high model den- 
sity in our (prior dependent) MC-model collection. Those two 
regions, emphasized by the dotted lines on the right panels of 
Fig- in are crossing each other around U-B= 0. Clusters in the 
lower branch, with the higher model density (at V-K~ 2), will 
be assigned an age of 1 Gyr and produce feature[5]a. The older of 
these will be assigned higher extinction values. Clusters near the 
second and more diagonal branch (V-K> 2.5) will be assigned 
ages between 100 Myr and 1 Gyr, with a systematic trend to- 
wards younger ages for redder, older clusters (feature OyS). Note 
that the accumulation [3]^, described previously, is still present 
but weaker. EeatureOf has disappeared. 

Again, young continuous models are typically more lumi- 
nous, per unit mass, than older ones. In most cases, this age ef- 
fect is stronger than the luminosity variation associated with the 



dereddening procedure. Therefore, obtained masses are usually 
smaller than the actual masses. There are exceptions however, 
for clusters associated with large estimated (i.e. overestimated) 
amounts of extinction. 

4.3.2. Bayesian estimates 

Despite the additional degree of freedom associated with extinc- 
tion, the Bayesian analysis based on stochastic cluster catalogs 
generally returns ages and masses in good agreement with the 
input values (top panels of Eig. [5]) . 

Above 20 Myr, 90 % of the most probable ages lie within 
0.3 dex of the actual age (r.m.s. dispersion of 0.1 dex). Eor the 
remaining 10% of the clusters in the sample, the age estimates 
present attraction points again, and this leads to underestimates 
of up to 1.5 dex. The favoured ages are located just below 1 Gyr 
and around 100 Myr. The corresponding features are labeled 3. a 
and 3.b in Eig. [5] While their locations are not far from those of 
3. a and 3.13, their origins are diff'erent. 

The Bayesian "attractors" are regions of high concentration 
in the model density plots of the right panels of Eig. |4l They do 
not lie along the locus of continuous population synthesis mod- 
els. Since all models with magnitudes within about 0.05 of the 
observations are given large weights in the computation of the 
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posterior probability distribution, areas of high model density 
along the dereddening line are favoured. An illustration is pro- 
vided in Fig. lA.2l and discussed further in Sect. 16.21 The clusters 
affected by this issue are usually located in regions of low model 
density of the colour-colour and colour-magnitude planes; they 
are assigned positive extinction values. 

4.3.3. Reddened input clusters 

As the extinction seems to play a role in the resulting trends 
of derived ages and masses from integrated photometry anal- 
yses, we conduct the same experiment as before, where the 
input sample photometric dataset was reddened by a factor 
of 1 optical magnitude, assumin g a Cardelli extinction law 
(ICardelH. Clavton. & Mathisl ll989^ 

Fig.[6]presents the resulting age and mass distributions of the 
derived properties when using the Bayesian method. 97% of the 
clusters have a correct derived extinction of 1 mag according to 
the Bayesian estimates. The standard deviations of the residuals 
are of 0.2 dex in age above 20 Myr and 0.1 dex in mass above 
10^ Mq. Trend features presented on Fig. [6] are very similar to 
the previously mentioned ones. A few clusters now have overes- 
timated ages, because reddening has moved them from the blue 
side to the red side of a region of high model density and typ- 
ically older ages. This region of high model density becomes a 
Bayesian attractor for these reddened clusters, while it was out of 
bounds in the reddening-free case (reaching this region of high 
model density would have required negative extinction correc- 
tions). 

4.4. Estimates from noisy fluxes 

We now reproduce the analysis of Sect. l4.3l for the same inputs, 
but after having added Gaussian noise to the input fluxes (5 %). 
Figure [7] presents the results. At the top of this figure are also 
plotted estimates derived from the single best-;^^ fit. 

Although the inputs are now noisy, the trends obtained re- 
main similar. In particular, there is no important diff'erence with 
the noise-free case when the analysis is based on standard, con- 
tinuous population synthesis models (bottom panels): most fits 
are poor, ages tend to be underestimated, masses are highly dis- 
persed around the correct values. 

Bayesian estimates remain close to the expected values even 
though the dispersion has increased. The standard deviation of 
the residuals are of 0.15 dex in age and 0.13 dex in mass ex- 
cept for models in 6.a and 6.b features. Only 13 % of the model 
clusters are assigned underestimated ages. 

A pleasant fact is that the single best-;^^ fit provides re- 
sults that are similar to the Bayesian ones (see discussion in 
Sec. 16.2.11) . Although individual clusters are assigned diff'erent 
ages, the estimated properties of the sample as a whole are de- 
scribed similarly with both methods. For 84 % of the clusters, the 
single best fit age and the Bayesian ages are both within 0.3 dex 
of the actual ages. Of the 16 % that deviate, 3 % deviate only 
with the single best fit method, 3 % only with the full Bayesian 
method, and 10 % deviate in similar ways with both methods. 

5. When there is no K band data 

It is common understanding that near-IR light is a better tracer 
of mass in galaxies than optical light, and that including near- 
IR photometry helps break degeneracies and estimate ages. 
Particularly illustrative figures on the latter point can be found 



m iBridzius et all (l2008h . However, all these statements rest on 
models that are valid only in the limit of large numbers of stars, 
i.e. not for most of the real star clusters in the Universe. This 
Section re-assesses the role of photometry beyond wavelengths 
of 1 fim in the stochastic context appropriate for small clusters. 

Figure [8] shows the results obtained for the sample of 
Sect. 14.11 when using noise-free UBVI input fluxes. Extinction 
is a free parameter although, as above, the clusters analysed are 
not reddened. Figure [8] compares directly to Fig.O 

With a smaller number of observational constraints, it is 
much easier to obtain statistically acceptable values. 80 % of 
the clusters in the sample now find an acceptable match among 
the standard, continuous population synthesis models (with ex- 
tinction when convenient). 

A very unusual result is that between about 10^ and 10^ yr 
the standard method (based on continuous models) produces bet- 
ter ages when the K data is absent than when it is present. This is 
due to the relative locations of the line of continuous models and 
the regions of high model densities, in colour space (Fig. (4]). At 
intermediate ages the off'set is of one full magnitude in (V - K), 
whereas it remains small in other colours. For a cluster that is 
typically located in the high density region, the nearest continu- 
ous model will produce a decent without need for much red- 
dening in UBVI but large values in UB VIK. The rare but lu- 
minous AGB stars strongly increase the impact of stochasticity 
on the derived properties so that one should think about throw- 
ing K band data away when the UB Vl-estimated age is between 
100 and 500 Myr. 

At other ages, the standard method applied to UBVI fluxes 
shows artefacts: a very strong accumulation is found at estimated 
ages around 1 Gyr, and hardly any cluster is assigned ages be- 
tween 2 and 7 Gyr. But these artefacts are not much worse than 
those seen in the analysis of UB VIK data with continuous mod- 
els. 

In terms of estimated masses, the distributions around the 
correct values are similar whether one uses UBVI or UBVIK 
data as input to the standard method (continuous population syn- 
thesis). On average, the masses of the clusters for which the 
UBVI fit is satisfactory are underestimated slightly (0.2 dex at 
10^ Mo). 

When using the Bayesian analysis based on stochastic mod- 
els, the loss of the K band information translates into stronger 
artefacts in the derived age distributions. The model-density dis- 
tribution in colour space plays a more important role in Eq. (O 
when there is less contrast in the distribution, i.e. when K 
fluxes are absent. Ages in high density regions along the de- 
reddening lines become more attractive. Note that these artefacts 
disappear if the amount of reddening is known rather than being 
a free parameter. 

The artefacts in the age distributions of the standard and the 
stochastic method, without the K band, appear oddly similar (left 
panels of Fig. [8]). In fact, the clusters in features l.b and I.JS are 
mostly identical, while the clusters in features 7. a and 7. a form 
two subsets with very little overlap. 

One must distinguish three main behaviours for clusters that 
have ages between 1 and 10 Gyr. Feature 7. a is due to the clus- 
ters located in the main high-density region of model colour- 
space, i.e. the region that runs parallel to the line of old con- 
tinuous models in the lower right panel of Fig. |4lbut is not su- 
perimposed on that line. These clusters will be assigned correct 
ages with the Bayesian method, but an age around 1 Gyr (and 
a positive extinction) with the standard method. Feature 7. a is 
due to clusters located in the secondary high-density region of 
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Fig. 6. Derived age (left) and mass (right) distributions using Bayesian method analysing UBVIK data when 1 mag extinction 
(assuming Cardelli extinction law) is applied to the input population properties. This figure compares directly with Fig. [5] 



colour-space, which happens to be superimposed on the line of 
old continuous models (when there is no K band data). These 
clusters will be assigned correct ages with the standard method, 
but will be attracted to younger ages with the Bayesian method 
(with a positive extinction) because of the contrast in the model- 
density maps. Finally, the rarer clusters with larger than typical 
V - I colours will be attracted to ages around 100 Myr (with 
a positive extinction), both with the standard and the Bayesian 
methods (features l.b and I.JS). 

As noted earlier, underestimated ages lead to underestimated 
masses because stellar populations fade with time and extinc- 
tion corrections do not quite compensate for this. In our sam- 
ple, many of the old clusters aff'ected by the age artefacts get 
assigned masses near the lower limit (500 M©) of our catalog 
(see Sect.lO). 

Outside the main age artefacts seen for old clusters (when 
extinction is a free parameter), the Bayesian methods recovers 
ages and masses similarly well whether or not the K band fluxes 
are included in the input data. If we exclude objects lying in 
the features 7. a and l.b as well as objects afl'ected by the low- 
mass limit of our MC catalog, masses recovered with UB VI data 
sets are not significantly more dispersed (cr = 0.14) than those 
obtained with UBVIK data sets (cr = 0.13). Further aspects of 
photometric band-pass selection are discussed in Sect. 16.31 



6. Discussion 

Several decades have passed si nce the stoc hasticity of s tellar 
populations was first mentioned (iBarbaro & Bertelh 19771) as a 
serious issue. With powerful computers, stochasticity can now 
be taken into account explicitly when analysing the properties of 
unresolved populations. 

Considering stellar populations as stochastic requires some 
changes in habits, for instance because mass is not a simple scal- 
ing factor anymore: errors on absolute fluxes (e.g. due to un- 
certainties on distances) can afl'ect estimated ages; and observa- 
tions of only colours can provide some information on the mass. 
Clearly, more work is needed to explore the consequences of 
stochasticity more exhaustively. 



6.1. Non-stochastic studies : mass and age distributions 

The analysis of the integrated light of small clusters with 
continuous population synthesis models produces strong age- 
dependent artefacts in the derived ages, and a broad distribution 
of random errors in the derived masses. 

The good news is that there is no large systematic ofl'set be- 
tween estimated and real masses if the photometric data are of 
good quality and if observations that cannot find a statistically 
acceptable match among the continuous models are rejected. 
When the data have larger errors, fewer model fits are rejected. 
Then, a systematic trend appears in addition to the random er- 
rors: masses are underestimated on average. The value of the 
off'set depends on the photometric pass-bands available and on 
the existence (or not) of independent information on extinction. 
The average error is of 0.3 dex for a sample of clusters of masses 
around 10^ M© observed in UBVIK, when extinction is treated 
as a free parameter. 

The above means that mass distributions of large samples of 
clusters based on an analysis with continuous population syn- 
thesis models are probably not too strongly biased. Several em- 
pirica l determinations in the literature (see iRafelski & ZaritskyI 
l2005l and references therein) favour mass distributions, and 
we used such a law as a prior in our Bayesian analysis. We find 
no immediate reason to apply a correction to this result, although 
a detailed investigation of each individual dataset in the literature 
would be worthwhile. If, for instance, small masses are system- 
atically underestimated by 0.3 dex while large masses are not, 
the power law index of the mass distribution would require a 
small correctiorQ. 

Figure [9] shows the age-mass distribution obtained when 
UBVI data for our test sample of clusters is analysed with non- 
stochastic population synthesis models and no fit is rejected. 
While the properties of the test sample are smoothly distributed, 
the estimated properties are highly clustered. The figure looks 
similar with UBVIK data, but we chose UBVI for compari- 
son with the (UBVI-FHa)-based ag e-luminosity distribution of 
(more massive) Antennae clusters (iFall. Chandar. & Whitmord 
2005). The right panel of Fig. [9] shows the derived distribution 
in the age-luminosity plane, after conversion of the estimated 

^ We have not invest igated any systematics for large cluster masses. 
See lAnders et all (120041) for that regime. 
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Fig. 7. Estimations based on U, B, V, I and K band fluxes with 5% noise added to the photometry inputs, allowing an extinction Ay 
from to 3. Erom top to bottom, the direct fit of the noised populations by our catalog, the Bayesian estimations and the standard 
estimations. The conventions and axis significations are the same as Eig.[5l 



masses into luminosities with the age-dependent mass-to-light 
ratio given by Pegase. In an observational context, such a fig- 
ure would be truncated at low luminosity by some instrumental 
sensitivity limit. 

The sin iilarity between the distribution in Eig. [9] and that 
in Eig. 1 of lEall. Chandar. & Whitmor^ (l2005l) is striking, even 



more so when one mentally corrects for the sharp low-mass 
limit of our sample and our relative lack of young clusters. 
Accumulations and gaps occur at essentially the same ages (dif- 
ferences can be traced back to the diff'erent sets of evolutionary 
tracks used by the authors). We have discussed the origins of the 
artefacts in our test-sample age distribution in Sect.lH they arise 
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because real, stochastic clusters of small masses are distributed 
over a wide range of colours, and can therefore fall quite far off' 
on the line of continuous models when only extinction is avail- 
able to alter their colours. The clusters in the Antennae sample, 
however, are typically 50 times more luminous than the ones in 
our study. More massive clusters should lie closer to the lines 
of continuous models. Why then are these artefacts so strongly 
seen in the Antennae cluster distribution? Our interpretation is 
that the combined eff'ects of observational errors and real extinc- 
tion disperse the clusters enough in colour- space to produce a 
global derived age distribution that is similar to the distribution 
produced at lower masses by stochasticity. 

The input age distribution of the clusters in our test- sample 
(and in our main MC catalog of clusters) is constant in loga- 
rithmic age bins, and this produces derived age-luminosity dis- 
tributions that are very similar to the ones in the literature. Our 
tentative conclusion is that the adopted age-distribution is, for 
the time being, an adequate prior for the Bayesian, stochastic 
studies. Clearly, it would be desirable to explore quantitatively 
how sensitive the derived distributions are to the actual age and 
mass distributions, especially in the presence of observational er- 
rors. Any firm conclusion on the Antennae clusters, for instance, 
would require such a study. 



6.2. Prospects of the stochastic analysis 

The analysis of cluster colours based on a library of stochastic 
models provides ages and masses with small random errors, and 
with systematics that will be negligible in many situations. 

Figure \TU\ shows the derived age-mass distributions for our 
test sample, based on either UBVI or UBVIK data sets. Similar 
results are obtained if the input sample is reddened (Fig.[TT]). The 
situation can be improved even more if there are independent 
constraints on extinction. 

The figures show the grid pattern due to the finite sizes of our 
bins in estimated ages and masses. This is because we compute 
the posterior probabilities of Eq.|4]for finite intervals. 

The binning procedure has a caveat and an advantage. The 
caveat is that results are sometimes sensitive to the choice of 
the bin sizes or boundaries. We have looked at many probability 
maps in 2D projections of age-mass-extinction space to check 
where this occurs. Examples are given in Appendix lAl (all can 
not be developed here). As expected, the results are robust with 
respect to the binning (i.e. errors are smaller than or equal to on e 
bin size) when the maps are simple, single peaked (Fig. lA.lb , 
while they can depend strongly on binning when the maps are 
complex (i.e. in cases in which uncertainty estimates based on 
contours of equal probability would give large error bars). The 
second type of situation occurs especially when there is no a pri- 
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ori information on extinction. Figure IA.2I illustrates this event: 
the dereddening vector of an object crosses several regions of 
high model-density. The model properties within each of these 
regions end up with similar probabilities. Which one dominates, 
depends on the binning as shown on Fig. lA.3l (and on the density 
pattern itself). 

The advantage of binning is that we can, to first order, assign 
the ages, metallicities and extinction of a given bin the same a 
priori probabilities. This allows us to rescale the derived proba- 
bility maps for a cluster after calculation, if we wish to change 
the adopted priors (a worthwhile gain in computation time). In 
our current catalog, the total number of clusters with masses 
above about 10"^ M© is too small to allow us to study flatter mass 
distributions of clusters immediately, and we also lack clusters 
with ages below 10^ yr. We have therefore chosen to postpone 
the description of consequences of changes in the priors to a fu- 
ture article. 



6.2.1 . Other practical aspects 

Although conceptually simple, the implementation of the 
Bayesian, stochastic analysis is not immediate. Constructing the 
stochastic library requires CPU time and data storage. For each 
cluster, the spectra of at least the brightest 1000 stars need to 
be summed individually. The remaining low mass stars could 
be represented with a continuous sequence to save computation 
time. Synthetic broad band colours can then be measured on 
the resulting total spectrum. We decided to save all the spectra 
and to compute a set of 60 standard broad band fluxes (standard 
UB VRIJHK, but also most HST filters and some from other tele- 
scopes). For three metallicities and several millions of models 
in total, the computation took a couple of month on six 3 GHz 
CPUs, and half a terabyte of data was accumulated. 

Browsing through the collection to compute probabilities 
also takes time, even with the restriction to a few photometric 
bands. A few CPU-hours are required to construct probability 
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distributions of age, mass and extinction for 1000 clusters with 
our main MC catalog, allowing for extinction to vary in the range 
described above. It is somewhat faster to use a simple best-;^^. 
This may be good enough for statistical studies of large sam- 
ples, but it must be kept in mind that the single best fit can be 
far from the Bayesian most probable model in individual cases. 
Very small changes in the observed colours (due to small obser- 
vational errors) can modify single best fit parameters, while they 
leave Bayesian estimates unchanged. To provide error bars on 
the single best fit parameters, one has to explore the vicinity of 
the minimum For instance, one could compare the properties 
of all the models with a below a predefined t hre shold, as was 
done for large clusters in lBridzius et al.l (l2008l) or I Anders et al.l 
(|2004|) . Doing this exhaustively would be conceptually similar, 
and essentially as long to run, as our Bayesian approach. 

All the results in this paper are based on one single set of 
population synthesis models. Some caveats of these models are 
that they do not include the formation of carbon stars or Mira- 
type variability, and that they do not account for binaries or 
for stellar rotation. We have not yet compared the amplitude of 
the errors arising from stochasticity (studied here) with those 



due to uncertainties in the physical assumptions of the models. 
However, we expect qualitatively similar systematics with any 
set of stellar evolution tracks. It will unfortunately be necessary 
to recompute a completely new library if one wishes to explore 
diff'erent sets of stellar evolution tracks, to assume a diff'erent 
stellar mass function, or to use a diff'erent library of stellar spec- 
tra. This will clearly limit the pace at which the comparisons can 
be made. 

6.3. Impact of the photometric band selections 

In the present paper, the estimation of ages and masses has been 
shown to be aff'ected by stochasticity and also by the photo- 
metric datasets used during the analysis. We chose to focus on 
explaining behaviours rather than multiplying experiments with 
diff'erent pass-bands, and have therefore restricted our quanti- 
tative comparisons to two combinations. In a large fraction of 
stellar population studies, the UBVI combination is used as the 
standard selection: it provides a reasonable compromise between 
depth and spectral coverage for most of the available instru- 
ments. Moreover, it corresponds to a wavelength range where 
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and the F218W filter, centered around 220 nm (WFPC2 instru- 
ment on-board the Hubble Space Telescope). A brief summary 
is that the eff'ects of adding this UV band have amplitudes simi- 
lar to those obtained when adding K. The dispersions in the re- 
sults are comparable to those without UV, both for Bayesian or 
standard estimates. Artefacts are also qualitatively similar. Age 
estimates below 20 Myr are improved only slightly. 

If one however wants to include UV bands, the choice of an 
extinction law becomes more of an issue. Extinction laws i ndeed 
express a range of be haviours in the UV bands (e.glAUenl 
Fitzpatrick & Mass3 Il986l: ICardelli. Clavton. & MathisI 
ICalzetti et al.li200Qk . 

The decision of which filters to use rests on considerations 
of the astrophysical problem to tackle. Using either UV or K 
photometry does not deeply aff'ect the resulting estimates ages 
and masses of small clusters. One may first want to consider 
using discrete population models instead of continuous ones. 
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Fig. 11. Resulting age-mass distributions from the Bayesian 
analysis methods on the 1-mag reddened test-sample clusters. 
Grey dots are estimates from UBVIK bands whereas black dots 
are from UB VI bands only. Corresponding marginal distribution 
histograms follow the same color conventions. The dotted line 
on the histograms represent the test- sample distribution. 

spectral libraries are the most accurate. Adding the K band im- 
proves mass estimates in the non- stochastic regime of high total 
masses, but it has an observational cost as it requires the use of 
a diff'erent instrument. Testing the UBVIK combination in the 
stochastic regime provides useful information for the design of 
future surveys. 

In Sect, m we presented some eff'ects of the presence of K 
band data in addition to UBVI photometry. The K band does 
not improve significantly the situation when continuous models 
are being used (right panel on Fig. [T0|. In particular, we men- 
tioned that including the K band in the analysis of populations 
with AGB stars (100 - 500 Myr) may lead to even worse esti- 
mates (again with continuous models). This eff'ect is also visi- 
ble on the left-hand side panels of Fig. [121 On the other hand, 
we found only mild improvements when including K band data 
in the Bayesian analysis: the presence of K band information 
translates into weaker artefacts in the derived age distributions 
(artefacts which disappear when reddening is not a free param- 
eter). The left panel of Fig. [TO] presents recovered age-mass dis- 
tributions for UBVI and UBVIK datasets using the Bayesian ap- 
proach, with free extinction. Old populations (> 1 Gyr) age- 
mass estimates reflect the original distribution better when K 
band constraints are included. This is also visible on the right 
panels of Fig. [121 where the resulting distributions are signifi- 
cantly improved. 

At young ages, both our test sample and our main MC cata- 
log are underpopulated. Most small and young clusters have HR 
diagrams that resemble a truncated main sequence. Even if the 
stellar mass function from which their stars are drawn randomly 
extends to I2OM0, a small zero age cluster will in general con- 
tain zero ionizing stars, and a small cluster aged 10 Myr will 
contain zero red supergiants : two such objects are basically in- 
distinguishable, be it with or without K band data. This problem 
shows up clearly when we run the Bayesian analysis at young 
ages, but quantifying this eff'ect requires that we add more young 
clusters to our reference catalog. 

The UV spectral range raises questions similar to the near- 
infrared. We repeated some of the above experiments with UBVI 



7. Conclusions 

Studies of star cluster populations in galaxies have been based 
until now on continuous population synthesis models, that pro- 
vide a very poor approximation of the integrated light of clus- 
ters of small and intermediate masses because this light is deter- 
mined by a very small number of luminous stars. 

Based on large collections of Monte-Carlo simulations of 
star clusters that each contain a finite number of stars, this pa- 
per explores systematic errors that occur when the integrated 
fluxes of realistic clusters of small masses are analysed in terms 
of mass and age. Our ma in collection is built with the cluste r age 
and mass distributions of lMT Chandar. & Whitmord (120091) . ex- 
trapolated to masses lower than those observed in the Antennae 
galaxies. 

With the standard methods (continuous models), large sys- 
tematic errors afl'ect estimated ages and large random errors af- 
fect masses. If observational uncertainties on cluster fluxes are 
large and, as a consequence, quality-of-fit criteria fail to reject 
the numerous poor fits, systematic errors (of a few tenths of a 
dex) are also present in the estimated masses. Derived age-mass 
or age-luminosity distributions for samples in which actual ages 
and masses are distributed as in our main collection display clus- 
tered patterns that very closely resemble those found in empir- 
ical samples in the current literature. We find no immediately 
obvious reason to reject the age and mass distributions of our 
main collection, but clearly this essential point requires detailed 
study with real observations. 

A Bayesian method has been described and implemented, in 
order to account explicitly for the finite nature of clusters in the 
analysis. It is shown that their age and mass can be recovered 
with error bars that will be small enough for many purposes. 
Young small mass clusters will remain difficult to age-date be- 
cause HR-diagrams of many of them identically look like trun- 
cated main sequences with no ionizing or post-main sequence 
stars. At intermediate ages, the variability of luminous AGB 
stars is expected to cause difficulties that we have not yet solved. 

The comparison between the results obtained with UBVI and 
UBVIK data sets shows that, in the stochastic context, the bene- 
fits of adding the K band to optical observations are rather small, 
except for the mass determination of clusters older than 1 Gyr. 
Clearly, adding near-IR or UV information is secondary, com- 
pared to the need to move from continuous to stochastic cluster 
models. 

The Bayesian analysis method can now be applied to exist- 
ing data on cluster samples in nearby galaxies with the aim of 
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constraining the actual age and mass distributions of these clus- 
ters. We will also extend the study of systematic errors to the 
case where metallicity is an unknown parameter. 

Acknowledgements. We are grateful to Michel Fioc for kindly providing his 
Fortran 90 version of Pegase, and to M. Fioc and B. Rocca-Volmerange for their 
support to the implementation of discrete population synthesis. 
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ments and uncertainties, we establish the joint probability distri- 
butions of these parameters based on a large catalog of Monte- 
Carlo simulations. In this appendix, we present examples of 
probability distributions. 

First an example is given on Fig. lA.U for which we obtain 
single peaked probability distributions leading to an unambigu- 
ous probabilistic determination of age, mass and extinction esti- 
mates. In this example, we recover that the most likely have no 
extinction and the most probable age and mass are 35 Myr and 
630 Mq, very close to the expected values. 

As we discussed in Sect. 16.21 it also happens that the prob- 
ability distributions are not single peaked. Complex maps oc- 
cur when there is no independent known information on the 
amount of extinction. Figure lAl2l illustrates how the extinction 
factor increases the complexity of the probability maps. The 
top-left panel contours show how the color distributions are af- 
fected along the dereddening vector. On this example, the object 
crosses three regions of high model density. The resulting age- 
mass probability distribution is multi-modal (trimodal). 

Complex distributions are subject to binning issues. The ex- 
ample probability distributions given on Fig. IA.2I yields three 
modes with similar probabilities. If one changes the number of 
bins on which the probabilities are computed, peak values might 
vary to eventually change the dominant mode of the distribu- 
tions. The Figure lA3l shows that the variations of the resulting 
age estimates can be significant: from 2.3 Gyr to 50 Myr . A 
determination of the optimal binning remains an open issue. 



Appendix A: Examples of probability maps 

The method we developed to take the stochasticity into account 
to estimate intrinsic parameters (age, total number of stars or 
total mass, metallicity, extinction) of unresolved population fol- 
lows a Bayesian approach. Given a set of photometric measure- 
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Fig. A.l. Example of probability map for a population of 33 Myr and 650 M© based on UBVIK band data, allowing extinction to 
vary between Ay = and Ay = 3. On the top-left panel, contours use an arbitrary scale to represent the colour joint probability 
distribution derived from the measurements (i.e. Eq.|4l applied to colours); the colour scale refers to model density. The white circle 
indicates the locus of the studied population (error bars of 0.05 mag are not indicated). On the other panels, the colour scale refers 
to the corresponding 2D probability distributions. They are underlined by contours with arbitrary levels. The white circles indicate 
the position of the highest probability. 
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Fig. A.2. Example of probability maps for a population of 2.3 Gyr and 780 M© based on UBVIK band. Symbols and contours are 
the same as on Eig. lA.ll Age and mass estimates have been computed over 40 bins (refer to the binning issue described in Sect. 16.21 
and also related to Eig. lA.3l) . 
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Fig. A.3. Illustration of the effect of binning using the cluster of Eig. lA.2[ On left-hand side panel the correct age is recovered with 
40 bins, however as the right panel illustrates, a different binning (50 bins) favours a much younger age with (erroneously) high 
extinction. The cluster is now moved to feature [3 in EigJSl 



